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Abstract — A  channel-coded  physical-layer  network  coding 
strategy  is  refined  for  practical  operation.  The  system  uses 
frequency-shift  keying  (FSK)  modulation  and  operates  nonco- 
herently,  providing  advantages  over  coherent  operation:  there 
are  no  requirements  for  perfect  power  control,  phase  synchro¬ 
nism,  or  estimates  of  carrier-phase  offset.  In  contrast  with 
analog  network  coding,  which  relays  received  analog  signals  plus 
noise,  the  system  relays  digital  network  codewords,  obtained  by 
digital  demodulation  and  channel  decoding  at  the  relay.  The 
emphasis  of  this  paper  is  on  the  relay  receiver  formulation. 
Closed-form  expressions  are  derived  that  provide  bitwise  log- 
likelihood  ratios,  which  may  be  passed  through  a  standard 
error-correction  decoder.  The  role  of  fading-amplitude  estimates 
is  investigated,  and  an  effective  fading-amplitude  estimator  is 
developed.  Simulation  results  are  presented  for  a  Rayleigh  block¬ 
fading  channel,  and  the  influence  of  block  length  is  explored. 
An  example  realization  of  the  proposed  system  demonstrates  a 
32.4%  throughput  improvement  compared  to  a  similar  system 
that  performs  network  coding  at  the  link  layer.  By  properly 
selecting  the  rates  of  the  channel  codes,  this  benefit  may  be 
achieved  without  requiring  an  increase  in  transmit  power. 

Index  Terms — Network  coding,  two-way  relay  channel, 
frequency-shift  keying,  noncoherent  reception,  channel  estima¬ 
tion. 

I.  Introduction 

IN  the  two-way  relay  channel  (TWRC),  a  pair  of  source 
terminals  exchange  information  through  an  intermediate 
relay  without  a  direct  link  between  the  sources  [1].  The 
exchange  can  occur  in  two,  three,  or  four  orthogonal  time 
slots,  depending  on  how  the  information  is  encoded  [2],  With 
a  traditional  transmission  scheduling  scheme,  the  exchange 
requires  four  slots.  In  each  of  the  first  two  slots,  one  of  the 
terminals  transmits  a  packet  to  the  relay,  while  in  each  of 
the  last  two  slots,  the  relay  transmits  a  packet  to  each  of  the 
terminals.  By  using  network  coding  [3],  the  number  of  slots 
can  be  reduced.  With  link-layer  network  coding  (LNC),  the 
third  and  fourth  slots  are  combined  into  one  slot  by  having 
the  relay  add  (modulo-2)  the  packets  that  it  receives  from  the 
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Fig.  1.  (a)  Link-layer  network  coding,  and  (b)  Physical-layer  network  coding. 

two  terminals.  During  the  third  step,  the  relay  sends  the  sum 
of  the  two  packets,  and  each  terminal  is  able  to  recover  the 
information  from  the  other  terminal  by  subtracting  (or  adding, 
modulo-2)  its  own  packet  from  the  received  signal.  With 
physical-layer  network  coding  (PNC),  the  first  two  slots  are 
combined  by  having  the  two  terminals  transmit  their  packets  at 
the  same  time  [2].  The  relay  receives  a  combination  of  both 
modulated  packets  during  the  first  slot,  which  it  broadcasts 
(after  appropriate  processing)  to  the  two  terminals  during  the 
second  slot.  PNC-based  strategies  capable  of  supporting  more 
than  just  two  source  terminals  over  the  TWRC  may  be  found 
in  [4], 

The  transmission  schedules  for  LNC  and  PNC  are  illustrated 
in  Fig.  1 .  The  source  terminals  Afi  and  A 4  transmit  messages 
Ui  and  U2,  respectively,  where  each  message  is  a  packet 
containing  many  information  bits.  The  messages  are  (channel) 
encoded  and  modulated  by  the  function  L ,9  ( • ) .  In  the  case  of 
LNC,  the  two  messages  are  sent  in  orthogonal  time  slots,  while 
in  the  case  of  PNC,  they  are  sent  to  the  relay  at  the  same  time 
over  a  multiple-access  channel  (MAC).  For  both  LNC  and 
PNC,  the  relay  broadcasts  the  encoded  and  modulated  signal 
IV(uj  in  the  final  time  slot,  where  u  is  the  network  codeword 
and  TR(-)  is  the  function  used  by  the  relay  to  encode  and 
modulate  the  network  codeword.  Using  the  received  version 
of  L/Fu)  and  knowledge  of  its  own  message,  each  terminal 
is  able  to  estimate  the  message  sent  by  the  other  terminal. 

There  are  several  options  for  implementing  PNC.  The  relay 
may  simply  amplify  and  forward  the  signal  received  from  the 
end  nodes,  without  performing  demodulation  and  decoding. 
This  PNC  scheme  is  referred  to  as  analog  network  coding 
(ANC)  in  [5]  and  PNC  over  an  infinite  field  (PNCI)  in  [6], 
Another  option  is  for  the  relay  to  perform  demodulation  and 
decoding  in  an  effort  to  estimate  the  network  codeword,  which 
is  remodulated  and  broadcast  to  the  terminals.  This  scheme 
is  simply  called  PNC  in  [2]  and  PNC  over  a  finite  field 
(PNCF)  in  [6],  but  in  this  paper  we  refer  to  it  as  digital 
network  coding  (DNC)  to  distinguish  it  from  ANC.  Under 
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many  channel  conditions,  DNC  offers  enhanced  performance 
over  ANC.  This  is  because  the  decoding  operation  at  the  relay 
helps  DNC  to  remove  noise  from  the  MAC  phase,  while  the 
noise  is  amplified  by  the  relay  when  ANC  is  used.  However, 
ANC  avoids  the  computational  complexity  of  demodulation 
and  decoding  at  the  relay. 

Symbol  timing  is  a  critical  consideration  in  systems  em¬ 
ploying  PNC.  Synchronization  of  the  clocks  and  packet  trans¬ 
missions  at  the  two  source  nodes  can  be  achieved  by  network 
timing  updates.  These  updates  are  routine  in  networks  with 
scheduling  mechanisms,  such  as  cellular  networks.  When  the 
propagation  times  of  the  signals  from  the  sources  differ,  the 
symbols  arrive  at  the  relay  misaligned.  The  timing  offset  is 
t  =  Ad/c,  where  c  is  the  speed  of  light,  and  is  the 
difference  in  link  distances  from  the  sources  to  the  relay.  For 
insignificant  delay,  we  need  r  <<  Ts/ 2,  where  Ts  denotes 
the  symbol  period.  This  constraint  limits  the  symbol  rate.  As 
an  example,  assume  =  300  meters.  Then,  Ta  >>  2  /is  is 
required,  and  the  symbol  rate  is  limited  to  250  kilosymbols/s. 
An  alternative  is  to  delay  the  transmission  of  the  node  closer 
to  the  relay  by  r.  However,  this  requires  tracking  the  distances 
between  the  sources  and  the  relay. 

A  common  assumption  made  in  the  PNC  literature  is 
that  the  signals  are  coherently  demodulated  and  that  perfect 
channel-state  information  (CSI)  is  available  at  the  receivers. 
For  instance,  decode-and-forward  relaying  has  been  consid¬ 
ered  for  binary  phase-shift  keying  [7]  and  minimum-shift 
keying  [8]  modulations,  but  in  both  cases  the  relay  must 
perform  coherent  reception.  An  amplify-and-forward  protocol 
is  considered  in  [9],  which  allows  the  decision  to  be  deferred 
by  the  relay  to  the  end-node,  though  detection  is  still  coherent. 
When  two  signals  arrive  concurrently  at  a  common  receiver, 
neither  coherent  detection  nor  the  cophasing  of  the  two 
signals  (so  that  they  arrive  with  a  constant  phase  offset)  is 
practical.  The  latter  would  require  preambles  that  detract  from 
the  overall  throughput,  stable  phases,  and  small  frequency 
mismatches.  To  solve  this  problem,  frequency- shift  keying 
(FSK)  was  proposed  for  DNC  systems  in  [10]  and  [11], 
A  key  benefit  of  using  FSK  modulation  is  that  it  permits 
noncoherent  reception,  which  eliminates  the  need  for  phase 
synchronization.  An  alternative  to  noncoherent  FSK  is  to  use 
differential  modulation,  which  has  been  explored  in  [12]. 

In  PNC  systems,  it  is  desirable  to  protect  the  data  with 
a  channel  code.  The  combination  of  channel  coding  and 
physical-layer  network  coding  is  considered  in  [13]  In  [11], 
we  investigate  the  use  of  a  binary  turbo  code  in  a  noncoherent 
DNC  system.  When  using  a  binary  turbo  code  in  a  DNC 
system,  the  relay  demodulator  must  be  able  to  produce  bitwise 
log-likelihood  ratios  (LLRs)  that  are  introduced  to  the  input 
of  the  channel  decoder. 

Channel  estimation  is  an  important  issue,  especially  when 
a  channel  code  is  used.  A  training-based  channel  estimation 
scheme  for  PNC  at  the  relay  assuming  amplify-and-forward 
operation  is  considered  in  [14].  The  relay  estimates  channel 
parameters  from  training  symbols  and  adapts  its  broadcast 
power  in  order  to  maximize  the  signal-to-noise  ratio  at  the  end 
nodes.  Estimation  of  both  channel  gains  in  the  two-way  relay 
channel  at  the  end  nodes,  rather  than  the  relay,  is  considered 
in  [15],  Novel  channel  estimators  are  presented  which  provide 


better  performance  than  common  techniques  such  as  least- 
square  and  linear-minimum-mean-squared  error  estimation.  In 
[16],  we  propose  a  blind  channel  estimator  for  the  relay  of  the 
noncoherent  DNC  system. 

In  this  paper,  we  investigate  receiver-design  issues  related 
to  the  use  of  noncoherent  FSK  in  DNC  systems.  While 
noncoherent  FSK  has  been  previously  proposed  for  DNC 
sy terns  in  [10],  we  make  the  following  specific  contributions: 

1)  We  provide  closed-form  expressions  for  the  relay  re¬ 
ceiver  decision  rule  with  different  types  of  CSI.  This 
is  in  contrast  with  [10],  which  resorted  to  numerical 
methods  to  solve  the  decision  rule  (see  the  comment 
below  equation  (8)  in  [10]). 

2)  We  consider  the  use  of  a  turbo  code  for  additional 
data  protection.  This  requires  that  the  relay  receiver  be 
formulated  so  that  it  produces  bitwise  LLRs,  which  may 
be  passed  through  a  standard  turbo  decoder. 

3)  We  provide  results  for  Rayleigh  block-fading  channels. 
The  results  in  [10]  were  only  for  a  phase-fading  channel. 

4)  We  propose  a  channel  estimator  that  is  capable  of 
determining  the  fading  amplitudes  of  the  channels  from 
the  two  terminals  to  the  relay.  The  estimator  does  not 
require  pilot  symbols. 

The  remainder  of  this  paper  is  organized  as  follows. 
Section  II  presents  the  system  model  used  throughout  the 
paper.  Section  III  derives  the  relay  receiver,  while  Section 
IV  discusses  channel-estimation  issues.  Section  V  provides 
simulation  results,  and  Section  VI  concludes  the  paper. 


II.  System  Model 


The  discrete-time  system  model  shown  in  Lig.  2  gives 
an  overview  of  the  processing  at  all  three  nodes.  Terminal 
Afi,i  £  {1,2},  generates  a  length- A'  information  sequence, 
u,  =  ...,  UitK\-  The  two  terminals  channel-encode 

and  modulate  their  information  sequences  using  the  function 
rs(-),  which  is  common  to  both  nodes.  A  rate-ri  turbo  code 
is  used,  and  the  resulting  length  Ls  =  K/r\  turbo  codeword 
generated  by  Mi  is  denoted  by  b,  =  [bi  i,  ...bi,Ls]  (not  shown 
in  the  diagram).  The  signal  transmitted  by  node  Mi  during 
signaling  interval  kTs  <  t  <  (k  +  1)TS  is 


Si(t) 


2  M/Ci  +  ^ 


(t  -  kTs) 


(1) 


where  £,  is  the  transmit  energy,  fCi  is  the  carrier  frequency 
of  node  Mi  (in  practice,  the  carrier  frequencies  of  the  two 
nodes  are  not  necessarily  the  same),  and  Ts  is  the  symbol  pe¬ 
riod.  Note  that  (1)  is  continuous-phase  frequency-shift  keying 
(CPLSK)  with  a  unity  modulation  index,  which  is  orthogonal 
under  noncoherent  demodulation  and  has  a  continuous  phase 
transition  from  one  symbol  to  the  next  [17],  The  orthogonally- 
modulated  signal  Si  (t)  may  be  represented  in  discrete  time  by 
the  2  x  L$  matrix  X,;  =  [x,.i,  with  kth  column 


X-i,k 


1  0  ]T  if  bi:k  =  0 
0  1  }T  if  bitk  =  1. 


For  the  DNC  system,  the  signals  are  transmitted  simulta¬ 
neously  by  the  two  source  nodes  over  a  MAC  channel.  The 
relay  receives  the  noisy  electromagnetic  sum  of  interfered  and 
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Fig.  2.  Discrete-time  system  model. 


faded  signals,  Y,  and  applies  the  demodulation  and  channel¬ 
decoding  function  F^1  (■).  The  demodulation  operation  yields 
a  soft  estimate  of  the  network-and-channel-coded  message 
b  =  bi  ©  b2  (not  shown),  while  the  channel-decoding  op¬ 
eration  yields  a  hard-decision  on  the  network-coded  message 
u  =  ui  ©  U2.  With  the  LNC  system,  the  two  sources  transmit 
during  orthogonal  time  slots.  The  received  versions  of  Xi  and 
X2  are  demodulated  independently  to  provide  soft  estimates 
of  bi  and  b2.  These  soft  estimates  are  combined  and  turbo 
decoded  to  yield  a  hard  estimate  of  u.  The  key  distinction 
between  DNC  and  LNC  is  that  with  the  DNC  system,  the 
estimate  of  b  is  obtained  directly  from  Y,  while  with  LNC  it 
is  found  by  independently  demodulating  the  two  source  signals 
and  then  combining  them. 

During  the  broadcast  phase,  the  relay  encodes  and  mod¬ 
ulates  u  using  the  function  r#(-),  which  may  be  different 
than  the  function  Fs(-)  used  by  the  sources.  The  channel 
code  applied  by  the  relay  is  a  rate-r2  turbo  code,  yielding 
a  length  Lr  =  K/r2  turbo  codeword.  The  code  rates  n 
and  r2  used  by  the  sources  and  relays,  respectively,  do  not 
need  to  be  the  same.  In  the  simulation  results,  we  contemplate 
using  a  stronger  code  for  the  MAC  phase  than  the  broadcast 
phase,  i.e.  n  <  r2-  The  relay  broadcasts  its  encoded  and 
modulated  signal,  which  may  be  represented  in  discrete-time 
by  the  2  x  Lr  matrix  X.  The  signal  traverses  two  independent 
fading  channels,  and  the  end  nodes  receive  independently 
faded  versions  of  X:  Zi  at  M\  and  Z2  at  A/2.  The  end 
nodes  demodulate  and  decode  their  received  signals  using 
the  function  L^1  (■),  and  form  estimates  of  u.  Let  u  denote 
the  estimate  at  Mi  and  u  denote  the  estimate  at  A©.  Next, 
estimates  of  the  transmitted  information  messages  are  formed, 
U2  =  u  ®  Ui  at  M\  and  Ui  =  u  ©  U2  at  A/2.  Since  the 
links  in  the  broadcast  phase  are  conventional  point-to-point 
links,  specific  details  of  the  receiver  formulation  will  not  be 
presented  here.  A  detailed  exposition  of  receiver  design  for 
turbo-coded  CPFSK  systems  in  block  fading  channels  can  be 
found  in  [18]. 

All  of  the  channels  in  the  system  are  modeled  as  block- 
fading  channels.  A  block  is  defined  as  a  set  of  N  symbols 
that  all  experience  the  same  fading  amplitude.  The  duration 
of  each  block  corresponds  roughly  to  the  channel  coherence 
time.  Ideally  both  sources  transmit  with  the  same  carrier 
frequency  fCl  =  fC2.  However,  due  to  instabilities  in  each 
source  node’s  oscillator  and  different  Doppler  shifts  due  to 
independent  motion,  it  is  not  feasible  to  assume  that  these 
two  frequencies  are  the  same  at  the  relay  receiver.  At  best, 
the  relay  receiver  could  lock  onto  one  of  the  two  frequencies, 
in  which  case  the  received  phase  of  the  other  signal  would 


drift  from  one  symbol  to  the  next.  To  model  this  behavior, 
we  let  the  phase  shift  within  a  block  vary  independently  from 
symbol  to  symbol. 

The  signal  matrix  X,  transmitted  by  node  A/i  may  be 
partitioned  into  Nb  =  Lr/N  blocks  according  to 


X,  = 


(3) 


where  each  block  X.[  ,  1  <  i  <  Nb,  is  a  2  x  N  matrix,  and 
Nb  is  assumed  to  be  an  integer.  The  channel  associated  with 
block  is  represented  by  the  N  x  N  diagonal  matrix 

x  diag(exp{j0l^1)},  ...,exp{j^})  (4) 

where  a)  is  a  real-valued  fading  amplitude  and  8\  j,  is  the 
phase  shift  of  the  kth  symbol.  The  {9^1}  are  independent  and 

identically  distributed  over  the  interval  [0,  2n).  The  {a^}  are 
normalized  so  that  Et  represents  the  average  energy  of  terminal 
Mi  received  by  the  relay.  The  Ith  block  at  the  sampled  output 
of  the  relay  receiver’s  matched-filters  is  then 

YW  =  +NW  (5) 

where  is  a  2  x  N  noise  matrix  whose  elements  are  i.i.d. 
circularly-symmetric  complex  Gaussian  random  variables  with 
zero  mean  and  variance  Nq. 


III.  Relay  Receiver 

At  the  relay,  each  block  Y^Z  of  the  channel  observation 
matrix  Y  is  passed  to  a  channel  estimator,  which  computes 
estimates  of  the  aq  and  ot2  ■  A  full  description  of  the  esti¬ 
mator  is  given  in  Section  IV.  The  fading-amplitude  estimates 
and  channel  observations  are  used  to  obtain  soft  estimates  of 
the  network-and-channel-coded  sequence  b.  The  demodulator 
operates  on  a  symbol-by-symbol  basis,  and  therefore  we 
may  focus  on  a  single  signaling  interval  by  dropping  the 
dependence  on  the  symbol  interval  k  and  the  block  index  l. 
Let  bi  and  62  be  the  turbo-coded  bits  transmitted  by  terminals 
Mi  and  M2,  and  let  b  =  b\  ©62  be  the  corresponding  network- 
coded  bit.  The  relay  demodulator  computes  the  LLR 


m 


log 


P(b  =  i|y) 
P(b  =  0|y) 


P(bi  ®b2  =  l|y) 
P(bi  ©  b2  =  0|y) 


(6) 


where  y  is  the  corresponding  column  of  Y.  The  event  {61  © 
b2  =  1}  is  equivalent  to  the  union  of  the  events  {61  =  0,  b2  = 
1}  and  {61  =  1,  b2  =  0}.  Similarly,  the  event  {61  ©  b2  =  0} 
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is  equivalent  to  the  union  of  the  events  {61  =  0,  b2  =  0}  and 
{61  =  1,  62  =  1}.  It  follows  that 


A  (b)  =  log 


P  ({61  =  0,  b2  =  1}  U  {61  =  1,  b-2  =  0} |y) 


=  log 


P  ({61  =  0,  b2  =  0}  U  {hi  =  1,  b2  =  l}|y) 

P  ({61  =  0 ,b2  =  l}|y)  +  P  ({hi  =  1  M  =  o}|y) 
P  ({hi  =  0,  h2  =  0}  |y)  +  P  ({hi  =  1,  h2  =  l}|y) 


(7) 


where  the  second  line  follows  from  the  first  because  the  events 
are  mutually  exclusive. 


A.  LNC  Receiver 


In  the  LNC  system,  the  LLR’s  of  hi  and  h2  are  first 
computed  independently  during  the  orthogonal  time  slots  and 
are  then  combined  according  to  the  rules  of  LLR  arithmetic. 
The  LLR  of  the  signal  sent  from  node  Hi  to  the  relay  is 


A(6i) 


log 


P(bj  =  l|y) 

P{h  =  0|y) 


(8) 


where  y  is  the  signal  received  during  the  time  slot  that  node 
H  transmits.  When  the  fading  amplitudes  on,i  =  1,2,  are 
known,  but  the  phases  9i,i  =  1,2,  are  not  known,  then  (8)  is 
found  using  [19] 

-  .og,0(^)_log7o(^) 

(9) 


where  Iq(-)  is  the  zeroth-order  Bessel  function  of  the  first 
kind  and  yi  and  y2  are  the  components  of  y.  If  the  fading 
amplitudes  are  not  known,  but  have  Rayleigh  distributions, 
then  (8)  is  found  using  [19] 

Aft)  ^  (10» 

Once  the  individual  LLR’s  from  each  end  node  are  found 
using  (9)  or  (10),  the  LLR  of  the  LNC  system’s  network 
codeword  can  then  be  found  from  (7)  and  the  independence 
of  b\  and  b2  when  y  is  given: 


eA(fti)  _|_  gA (b2) 

Mb)  =  log  x  +  eA(&1)+A(62) 

=  max*  [A(bi),  A(&2)]  -  max*  [0,  A(&i)  +  A(62)]  (11) 

where  max*[i,  y]  =  log(ex  +  ev). 


B.  PNC  Receiver 

In  the  PNC  system,  it  is  not  sensible  to  compute  A(6i)  and 
A(62)  separately.  Instead,  use  (7)  and  assume  that  the  four 
events  are  equally  likely  along  with  Bayes’  rule  to  obtain 

A(6)=log  [p  (y|{&i  =  0  ,b2  =  1})  +  P  (y|{&i  =  1,  b2  =  0})] 
-log[p(y|{&i  =0  ,b2  =  0})+p(y|{6i  =  l,b2  =  1})]. 

(12) 

The  computation  of  each  p  {y\{bi,b2})  by  the  PNC  relay 
receiver  given  various  levels  of  channel  state  information  is 
the  subject  of  the  remainder  of  this  section. 


1)  Coherent  PNC  Receiver:  When  the  fading  amplitudes 
and  phases  are  known,  p  (y | {61 ,  b2})  is  conditionally  Gaus¬ 
sian.  The  mean  is  a  two-dimensional  complex  vector  whose 
value  depends  on  the  values  of  {61,  &2}  and  the  complex  fading 
coefficients  {/ii,/i2},  which  are  the  corresponding  entries  of 
the  H  matrix.  Let  m[bi,b2\  be  the  mean  of  y  for  the  given 
values  of  bi  and  b2.  When  b\  7^  &2,  the  two  terminals  transmit 
different  frequencies  and 

m[0, 1]  =  [hi  h2  ]T 

m[l,  0]  =  [  h2  M  ]T  (13) 

When  b\  =  bi,  the  two  terminals  transmit  the  same  frequency 
and 

m[0,  0]  =  [  (hi  +  h2)  0  ] 1 

m[l,  1]  =  [0  (hi  +  h2)  ]T  (14) 

Since  there  is  a  one-to-one  correspondence  between  the  event 
{&i,  b2}  and  the  mean  vector  m[6i,  62],  it  is  equivalent  to  write 
P  (y|{bi,  b2})  as  p  (y|m[6i,  b2}),  where 

p(y\m[bi,b2})  =  exp|-^-||y-m[6i,62]||2|. 

(15) 

The  coherent  receiver  computes  each  of  the  p  (y|{&i,  b2}) 
required  by  (12)  by  substituting  the  corresponding  m[bi,b2] 
defined  by  (13)  and  (14)  into  (15). 

2)  Noncoherent  PNC  Receiver  with  CS1:  Suppose  that 
the  receiver  does  not  know  the  phases  of  the  elements  of 
the  complex- valued  m[&i,62]  vectors,  but  does  know  the 
magnitudes  of  the  elements.  The  knowledge  of  the  magnitudes 
constitutes  a  type  of  channel-state  information  (CSI).  Define 
n[bi,  b2\  to  be  the  two-dimensional  real  vector  whose  elements 
are  the  magnitudes  of  the  elements  of  the  complex  vector 
m[&i,  b2\.  When  bi  7^  b2,  both  frequencies  are  used,  and 

M[0, 1]  =  [  \hi |  \h2\  ]T  =  l  a2  ]T 

M[1,0]  =  [  \h2\  \hi\  ]T  =  [  a2  Oil  ]T  (16) 

When  b  \  =  b  >,  only  one  frequency  is  used,  and 

M0,0]  =  [\hi+h2\  0]r=[a  0  ]T 

M[l,l]  =  [0  \hi  +  h2\]T=[0  a  ] T  (17) 

where  a  =  \hi  +  h2\  =  ■ 

The  pdf  of  y  conditioned  on  fi[bi,b2]  may  be  found  by 
marginalizing  over  the  unknown  phases 

p(y\v[bi,b2\)  =  /  p(^i,^2)p(y|m[6i,62])#i#2- 

J  0  ^0 

(18) 

where  <j>i  and  (j)2  are  the  phases  of  the  first  and  second 
elements  of  m[6i,62],  respectively. 

Assume  that  the  ai  are  Rayleigh  distributed  so  that  the 
hi  are  circularly- symmetric  zero-mean  complex  Gaussian*. 

*The  receiver  derived  in  this  subsection  is  valid  even  for  non-Rayleigh  fad¬ 
ing,  provided  that  the  received  phases  over  the  two  channels  are  independent 
and  uniform  over  (0,  2tt). 
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When  b\  ^  b2  each  element  of  m  [61,62]  is  a  circularly- 
symmetric  zero-mean  complex  Gaussian  and  therefore  has 
uniform  phase.  On  the  other  hand,  when  61  =62,  one  element 
is  6.1  +  62,  which  is  the  sum  of  two  circularly-symmetric  zero- 
mean  complex  Gaussians,  while  the  other  element  is  zero. 
Since  the  sum  of  two  circularly-symmetric  complex  Gaussians 
is  also  a  circularly-symmetric  complex  Gaussian,  it  follows 
that  61  +  62  is  a  zero  mean  circularly-symmetric  complex 
Gaussian  and  therefore  its  phase  is  uniform.  Since  the  other 
element  is  zero,  its  phase  is  irrelevant  and  may  be  set  to  any 
arbitrary  distribution,  which  is  most  conveniently  chosen  to 
be  uniform.  Thus,  it  follows  that  <f>\  and  cf> 2  are  i.i.d.  uniform. 
Therefore,  the  pdf  conditioned  on  the  magnitudes  is 


V  (y  I  A4  [61, 62])= 


nNc 


f*2ir 

exp 


0  Jo 


\yi  -  Ml  [61, 62] e^1 12 


Nn 


d(j)  1 


ttNi 


f27T  /  1 1/2  M2  [^1  ,  62]  |  2 

exp  < - 


0  Jo 


No 


#2  (19) 


where  Mfe  [61 , 62]  is  the  kth/  element  of  ^[61,62]  and 


1  [2n 
27T  ,/n  6XP 


I Vk  -  Pk[bi,b2]eJ<l,k\2 


No 


d(j>k 


=  exp 


\yk\2  +  (Hk[h,b2))2\  / 2|2/fc|Mfc[6i,  62] 


No 


No 


(20) 


3)  Noncoherent  PNC  Receiver  without  CSI:  Suppose  that 
besides  not  knowing  the  phases  9i,92l  the  relay  receiver  does 
not  know  the  magnitude  vector  p[bi,b2\.  Then,  the  relay  must 
operate  without  any  channel  state  information  except  for  the 
average  energies  £\ .  £2  and  the  noise  variance  Nf>.  When  the 
magnitudes  /x[&i ,  6*2]  are  not  known,  then  the  conditional  pdf 
is  found  by  marginalizing  (21)  over  the  unknown  magnitudes 


poo  poo 

p(y\{bi,b2})  =  /  p(pi,  p2)p  (y | /x [6*1 , 62] )  dpidp2. 

Jo  Jo 


(25) 


where  p\  and  p2  are  the  magnitudes  of  the  first  and  second 
elements  of  /x  [61,62],  respectively. 

According  to  (16),  when  b\  ^  b2,  one  of  the  pk  =  ot\  while 
the  other  ///,.  =  a2.  Since  «i  and  a2  are  independent  and  each 
cii  is  Rayleigh  with  energy  £,,  it  follows  that  the  joint  pdf  of 
Mi  and  p2  when  (61,62)  =  (0,1)  is 


p(Mi,  M2) 


(26) 


for  Mi,M2  >  0,  and  when  (61, 62)  =  (1,0)  it  is 


p(Mi,  M2) 


(27) 


Substituting  (20)  into  (19), 


>{y\fj.[bi,b2])  =  /3]^[expi- 


fc=  1 


{Pk[bi,b2])2 
No 


Xlr 


where 


K  =  Ij^l  «Pt- 


/  Z\yk\Pk[bi,b2\ 

1  No 

\yi\2  +  \y2\2 


No 


(21) 


(22) 


which  is  common  to  all  four  {61, 62}  and  will  therefore  cancel 
in  the  LLR  (12). 

For  each  event  {61,  62},  substitute  the  p  (y  |/x  [61 , 62])  given 
in  (21)  with  the  m[6i  ,  62]  given  by  (16)  and  (17)  as  the 
corresponding  p  (y|{6i,  62})  in  (12).  This  results  in 


A(6)  =  log 


e-ocl/NoT 


+e-“2/Jvo/o 


2a2\y 


-  log 


e-a2/JV0J 


No 

(  2a|t/i 

V  No 


2qi|mi 
No 

l|\  e-«?/JVo  r 


e-a2/lVo  IQ 

2ai\y2 


f  2.a2\y2 


V  No 


+  e"“2/JVo/c 


N0 

f  2.a\y2 


V  No 


(23) 


As  discussed  in  Section  IV,  it  is  possible  to  accurately  esti¬ 
mate  cti  and  02  in  the  considered  block  fading  environment, 
provided  the  blocks  are  sufficiently  long.  However,  it  is  not 
generally  feasible  to  precisely  estimate  a  because  the  phases 
6\  and  02  are  varying  on  a  symbol-by-symbol  basis.  Since 
E\cos{62  —  )]  =  0,  a  reasonable  approximation  when  an 

estimate  of  a  is  not  available  is  to  use 

a  ~  \]  a\  +  a2-  (24) 


for  Mi,  M2  >  0-  Substituting  (26)  and  (21)  into  (25)  yields 


p{y\{h  =  0,62  =  1})  = 


\yi 


\V2 


Nl 

£1 


■No 


N2 
£ 2 


■No 


4”  log 


1 


Similarly,  substituting  (27)  and  (21)  into  (25)  yields 


(28) 


p{y\{h  =  1,62  =  0})  = 


\yi 


\V2 


Nl 

£2 


No 


N2 

£1 


■No 


4"  log 


1 


(29) 


As  indicated  by  (17),  when  61  =  b2,  one  of  the  p,k  =  ct 
while  the  other  Mfc  =  0-  As  discussed  below  (18),  in  a 
Rayleigh-fading  environment,  hi  and  h2  are  independent, 
complex-valued,  circularly-symmetric  Gaussian  variables,  and 
therefore  6  =  61  +  62  is  also  a  complex-valued,  circularly- 
symmetric  Gaussian  variable.  It  follows  that  a  =  |6|  is 
Rayleigh  with  energy  £\  +  £2,  and  the  pdf  of  the  nonzero 
Mfe  is 


p{Pk) 


2  Mfe 

E\  +  £2 


Mfe  1 
£i  +  £2J  ’ 


Mfc>«.  (30) 


For  the  pk  =  0,  its  pdf  may  be  represented  by  an  impulse  at 
the  origin,  i.e.  p{pk)  =  8{pk)-  Substituting  these  pdfs  with 
the  appropriate  p[bi,b2]  into  (25)  yields 


p(y\{bi,b2})  =  log 


1 


£1  4-  £2 

\yi\2 


1 


£\  4-  £2 


1 

Wo 


Nn 


£l~\~£2 


■No 


-1 


(31) 
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where  i  =  1  when  (foi ,  62)  =  (0,  0)  and  i  =  2  when  (&i,  b2)  = 
(1,1)- 

Substituting  (28)  and  (29)  for  the  two  b\  ^  b2  and  (31)  for 
the  two  b\  =  b2  into  (12)  yields 


A  (b)  =  log 
+  log 
-  log 


exp  s  - 


£l$2 

HN0 

\ml 

Z 


1 2/2 


No 


exp  \  - 


\ml 

Ci  £2 


exp  <  - 


exp  <  - 


ll/ll2  _  \V2_ 
N0  C 
|yi|2  \V2? 


2  >  1 


£2  Ci 


(32) 


where  Ci  =  £i  +  N0,  C2  =  £2  +  N0,  and  £  =  £1  +  S2  +  N0. 


IV.  Channel  Estimator 

The  goal  of  the  channel  estimator  is  to  estimate  the  values 
of  the  fading  amplitudes  ot\  and  a2  for  a  particular  fading 
block.  Let  the  fading  amplitudes  of  a  block  be  represented 
by  the  pair  {A,  B},  where  A  >  B.  Thus,  A  =  maxjai,  a2} 
and  B  =  min{ai,  a2}.  Note  that  in  (23),  exchanging  ot\  and 
a2  does  not  change  the  final  expression.  Therefore  (23)  is 
commutative  in  «i  and  a2 ,  and  may  be  written  as 


A(6)  =  max  * 


F 


(2A\yi\\  ,  „{2B\m\ 


F  , 

V  No 
(2B\yi 


—  max* 


V  No 
2^A2  +  B2\yi 
An 


■  F 


V  No 
(2A\y2\ 

V  N0 

/2V/  A2  +  B2\y2\ 


,F 


No 


(33) 


where  the  approximation  a  ~  \J a2  +  0/  has  been  used  and 
F(x)  =  log[/o(a;)],  which  may  be  efficiently  and  accurately 
computed  through  the  following  piecewise  polynomial  fit: 


F(x )  =  log[/0(®)]  « 

'  0.22594®2  +  0.012495®  -  0.0011272 
0.12454®2  +  0.21758®  -  0.10782 
0.028787®2  +  0.63126®  -  0.56413 
0.003012®2  +  0.88523®  -  1.2115 
<  0.00053203®2  +  0.95304®  -  1.6829 

0.00013134®2  +  0.97674®  -  2.0388 
0.9943®  -  2.6446 
0.99722®  -  3.0039 
,  0.99916®  -  3.6114 


0  <  x  <  1 

1  <  x  <  2 

2  <  x  <  5 
5  <  x  <  15 
15  <  ®  <  30 
30  <  x  <  60 
60  <  ®  <  120 
120  <  ®  <  500 
®  >  500. 

(34) 


A.  Fading  Amplitude  Estimator 

To  estimate  A  and  B,  first  add  the  two  elements  of  each  y, 
to  obtain 


n  =  Hi,  1  +  yi,  2  =  hit  1  +  hiy2  +  m,i  +  nit2  (35) 

Si  v  ✓ 

Vi 

where  vt  is  circularly-symmetric  complex  Gaussian  noise  with 
variance  2Nq,  and  h^k  is  the  channel  coefficient  between 
terminal  A4,  fc  =  {1,2},  and  the  relay  during  the  ith  signaling 
interval.  The  signal  r,;  is  the  noisy  sum  of  two  complex  fading 


coefficients,  and  therefore  the  fading-amplitude  estimation 
algorithm  proposed  by  Hamkins  in  [20]  may  be  used.  To 
determine  the  values  of  A  and  B,  a  system  of  two  equations 
with  two  unknowns  is  required.  The  first  equation,  found  by 
taking  the  expected  value  of  \rt\2  under  the  assumption  that 
the  fading  amplitudes  are  fixed  for  the  block  in  question,  is 

E  [|ri|2]  =  E[ot\+al+2aia2cos{Qi,2-Qiti)\ 

=  E  [a\  +al]  =  a\  +  a\  =  A2  +  B2.  (36) 


The  second  equation  is  found  by  conditioning  on  the  event 
{|r|2  >  A2+B2},  which  is  equivalent  to  {cos(0i.2— 6^1)  >  0} 
and  has  expected  value  [20] 

4AB 


E 


>  A2 +  B2 


=  A 


B 


-■  (37) 


Solving  (36)  and  (37)  for  A  and  B  yields 


A  =  i 
2 


B  =  — 

2 


(1 

lx  +  l<Y-X)  +  S 

/x+|(X-F)j 

(1 

/*  +  f(y-x)-v 

/x+|(X-F)j 

( 

(38) 


>  A2  +  B2 


where  X  =  E  [|r|2]  and  Y  =  E 

Since  the  expected  values  required  for  (38)  are  not  known, 
they  may  be  estimated  by  using  the  corresponding  statistical 
averages,  1  n 

£n2 

i= 1 

E  N2  (39) 

i:\ri\2>X 


1 

X  = 

N 

2 

F  = 

N 

where  N  is  the  size  of  the  fading  block  and  the  factor  2/N 
used  to  compute  Y  assumes  that  j  rt  \ 1  >  X  for  approxi¬ 
mately  N/2  symbols.  If  this  assumption  is  not  true,  then 
the  multiplication  by  2/N  can  be  replaced  with  a  division 
by  the  number  of  samples  that  satisfy  \ri\2  >  X.  As  an 
alternative  to  summing  over  the  |ri|2  >  X,  Hamkins  proposes 
summing  over  those  r,  1 2  greater  than  the  median  value  of 
{|n|2,...,M2}[20]. 

The  estimator  works  by  computing  estimates  X  and  Y  using 
(39)  and  the  {7T,...,r/v}  for  the  block.  These  estimates  are 
used  in  place  of  X  and  Y  in  (38),  which  yields  estimates  A 
and  B  of  A  and  B.  These  estimates  are  then  used  in  place  of 
A  and  B  in  (33). 


B.  Transmission-Case  Detection 

According  to  (35),  the  two  elements  of  y.t  are  always  added 
together.  When  bi  =  b2,  only  one  tone  is  used,  and  the  noise 
can  be  reduced  if  the  receiver  processes  only  the  tone  used  and 
ignores  the  other  tone.  This  requires  that  the  receiver  be  able 
to  detect  whether  the  first  tone,  the  second  tone,  or  both  tones 
were  used,  which  may  be  implemented  using  a  variation  of  the 
"no-CSI”  receiver  described  in  subsection  III-B3.  In  [16],  we 
contemplate  an  estimator  that  uses  such  a  transmission-case 
detector.  However,  we  found  that  the  performances  with  and 
without  the  transmission-case  detector  were  virtually  identical 
and  do  not  consider  it  further  in  this  paper.  At  best,  proper  use 
of  the  transmission-case  detector  reduces  the  noise  variance 
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from  2 No  to  N0  during  the  symbol  intervals  that  both  nodes 
transmit  the  same  tone.  As  will  be  seen  in  the  numerical 
results,  the  estimator  is  resilient  enough  against  noise  that  this 
reduction  in  noise  variance  is  not  meaningful  and  does  not 
justify  the  additional  complexity. 

C.  Amplitude  Estimation  for  Single-Transmitter  Links 

During  the  broadcast  phase,  there  is  only  a  single  trans¬ 
mission,  and  the  dual-amplitude  estimator  described  in  sub¬ 
section  IV-A  is  not  necessary.  Similarly,  the  estimator  is  not 
needed  by  the  LNC  system  during  the  MAC  phase  since  the 
two  transmissions  are  over  orthogonal  channels.  To  estimate 
the  fading  amplitudes  for  the  links  involving  only  a  single 
transmitter  and  receiver,  the  simple  averaging  technique  given 
by  (29)  in  [21]  is  used,  which  is  described  as  follows.  Consider 
the  ith  signaling  interval  during  the  Ith  fading  block.  Given 
transmission  of  tone  k,  in  the  absence  of  noise,  the  kth 
matched-filter  output  at  the  receiver  is  y k,i  =  ae’Si,  and  has 
magnitude  \yk,i\  =  cu  All  other  matched-filter  outputs  in  the 
ith  signaling  interval  are  0.  An  estimate  could  be  formed  by 
taking  the  maximum  \yk,i\  over  any  column  of  Y/: .  In  the 
presence  of  noise,  an  estimate  of  a  can  be  formed  by  averaging 
across  all  columns  of  the  fading  block 

1  N 

a  =  —  V'max|2/fei|.  (40) 

iv  z '  k 

i=l 

V.  Simulation  Study 

This  section  presents  simulated  performance  results  for  the 
relay  receiver  described  in  Section  III.  The  simulated  link 
model  is  as  described  in  Section  II,  with  specific  simulation 
parameters  given  in  the  following  subsections.  The  goal  of 
the  simulations  is  to  compare  the  performance  of  comparable 
DNC  and  LNC  systems  and  to  assess  the  robustness  of  the 
channel  estimator  proposed  in  IV.  Because  the  relay-broadcast 
phase  of  the  DNC  and  LNC  systems  operate  in  exactly  the 
same  manner  and  have  the  same  performance,  we  only  focus 
on  the  performance  of  the  MAC  phase. 

A.  Uncoded  Performance  with  Perfect  Channel  Estimates 

We  initially  consider  a  system  that  does  not  use  an  outer 
error-correcting  code,  and  thus  b,  =  u, ,  i  =  1 , 2.  We  compare 
the  performance  of  the  LNC  and  DNC  systems.  With  the  LNC 
system,  the  two  nodes  transmit  their  messages  in  orthogonal 
time  slots  and  the  relay  receiver  first  generates  the  individual 
LLR’s  during  each  time  slot  using  either  (9)  or  (10),  and 
then  the  two  LLR’s  are  combined  using  (11).  When  there 
is  no  outer  error-correcting  code,  performance  using  (9)  is 
approximately  the  same  as  that  using  (10).  A  bit  error  is 
declared  at  the  relay  whenever  a  hard  decision  using  (11) 
results  in  an  erroneous  decision  on  the  corresponding  bit  of  the 
network  codeword  b.  Such  an  error  will  usually  occur  if  one 
of  the  two  bits  b  \ ,  62  is  received  incorrectly,  and  therefore  the 
error  rate  of  the  LNC  system  is  approximately  Pb  ~  2p(l—p) 
where  p  is  the  bit  error  rate  of  noncoherent  binary  FSK 
modulation  [17]. 

With  the  DNC  system,  the  two  nodes  transmit  simultane¬ 
ously,  and  the  relay  receiver  computes  the  LLR  using  (23) 


Fig.  3.  Bit  error  rate  at  the  relay  in  Rayleigh  fading  when  DNC  and  LNC  is 
used  and  £ 2  =  £1 .  Depending  on  the  amount  of  channel  state  information  that 
is  available,  the  PNC  system  will  use  one  of  three  different  relay  receivers. 


when  the  magnitudes  [61,62]  are  known  or  (32)  when  they 
are  not.  A  hard  decision  is  made  on  the  LLR  and  a  hit 
error  is  declared  if  the  estimate  of  the  corresponding  network 
codeword  bit  b  is  incorrect.  We  assume  that  the  channel 
estimates  are  perfect,  and  since  there  is  no  error-correction 
coding,  the  size  of  the  fading  block  is  irrelevant  provided  that 
the  channel  coherence  time  is  not  exceeded. 

Initially,  we  set  the  average  received  energy  to  be  the  same 
over  both  channels,  i.e.  £2  =  £1  =  £s  =  £f>-  Fig.  3  shows  the 
performance  of  the  LNC  and  DNC  systems  in  Rayleigh  fading 
with  equal  energy  signals.  As  anticipated,  the  LNC  system 
offers  the  best  performance,  which  is  approximately  3  dB 
worse  than  a  standard  binary  CPFSK  system  with  noncoherent 
detection  (the  loss  relative  to  conventional  CPFSK  is  due  to  the 
fact  that  both  bits  must  usually  be  received  correctly).  Three 
curves  for  the  DNC  system  are  shown  in  Fig.  3,  corresponding 
to  receivers  that  exploit  different  amounts  of  available  channel 
state  information.  The  best  performance  is  achieved  using  a 
receiver  implemented  with  (23),  which  requires  knowledge 
of  ai,a2,  and  a.  The  performance  of  the  DNC  system 
implemented  with  (23)  is  only  about  0.25  dB  worse  than  that 
of  the  LNC  system.  The  worst  performance  is  achieved  using 
a  receiver  implemented  using  (32),  which  does  not  require 
knowledge  of  the  fading  amplitudes.  The  loss  due  to  using 
(32)  instead  of  (23)  is  about  10  dB,  indicating  that  estimating 
the  fading  amplitudes  at  the  relay  is  necessary. 

While  it  may  be  feasible  to  estimate  ot\  and  0:2,  estimating 
a  may  prove  to  be  more  difficult  because  it  will  depend  on 
not  only  the  individual  fading  amplitudes,  but  also  on  the 
phase  difference  between  the  two  channels.  Since  the  phase 
difference  might  change  more  quickly  than  the  individual 
amplitudes,  it  might  not  be  practical  to  estimate  a.  If  that 
is  the  case,  then  the  approximation  given  by  (24)  can  be  used 
in  place  of  the  actual  value  of  a.  The  performance  using  this 
technique  is  also  shown  in  Fig.  3  and  shows  a  loss  of  about  3 
dB  with  respect  to  the  known- fi[bi,  62]  system,  which  requires 
knowledge  of  a. 

The  performance  of  DNC  is  sensitive  to  the  balance  of 
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Fig.  4.  Bit  error  rate  at  the  relay  in  Rayleigh  fading  of  DNC  with  three 
different  receivers  and  either  82  =  8\  (solid  line)  or  82  =  4£i  (dashed 
line). 


power  received  over  the  two  channels.  Performance  is  best 
when  £\  =  £2-  In  order  to  evaluate  how  robust  the  DNC  relay 
receivers  are  to  an  imbalance  of  power,  the  simulations  were 
repeated  with  £2  =  4£i,  while  keeping  £{,  =  £s  =  {£\-\-£f) 
These  results  are  shown  in  Fig.  4  for  the  three  receiver 
formulations  that  were  considered  in  the  previous  figure. 
When  the  power  is  imbalanced  in  this  way,  there  is  a  loss 
of  about  2  dB.  However,  the  loss  is  the  same  for  all  three 
receiver  implementations,  suggesting  that  they  are  robust  to 
an  imbalance  of  power. 

B.  Uncoded  Performance  with  Channel  Estimation 

We  now  consider  the  influence  of  channel  estimation,  but 
still  assume  that  the  system  does  not  use  error-correction 
coding.  In  the  simulations,  the  information  frames  generated 
at  the  end  nodes  contain  I\  =  2048  bits  per  frame.  The  fading 
blocks  are  length  N  =  {8,  32, 128}  symbols  per  block.  The 
DNC  relay  implements  (32)  and  then  makes  a  hard  decision 
on  each  information  bit. 

The  bit  error-rate  performance  of  the  uncoded  system  is 
shown  in  Fig.  5.  The  performance  is  shown  with  the  estimator 
using  the  three  block  sizes  N  =  {8,32,128}  as  well  as  for 
the  case  of  perfect  estimates  of  a,  and  02.  A  narrow  range 
of  error  rates  is  shown  to  better  highlight  the  differences  in 
performance.  In  general,  smaller  fading  blocks  lead  to  a  less 
accurate  estimation  of  the  fading  amplitudes,  as  the  number 
of  samples  available  for  estimation  decreases.  Moving  from 
block  size  N  =  128  to  32  worsens  performance  by  roughly 
0.25  dB,  and  from  N  =  32  to  8  by  0.75  dB. 

C.  Performance  with  an  Outer  Turbo  Code 

Now  consider  a  system  that  uses  an  outer  turbo  code. 
The  terminals  each  encode  length  K  =  1229  information 
sequences  into  length  L  =  2048  codewords,  using  a  rate 
ri  ~  0.6  UMTS  turbo  code  [22].  The  relay  performs  turbo 
decoding  using  the  codeword  LLR’s  computed  by  (32).  The 


Fig.  5.  Influence  of  fading-block  length  N  on  uncoded  DNC  error-rate 
performance  at  the  relay.  In  addition  to  curves  for  three  values  of  N ,  a  curve 
is  shown  indicating  the  performance  with  perfect  fading-amplitude  knowledge. 


Fig.  6.  Influence  of  fading-block  length  N  on  turbo-coded  DNC  error- 
rate  performance  at  the  relay.  Two  curves  are  shown  for  each  value  of 
N  =  {8,  16,  32,  64,  128}.  Solid  curves  denote  perfect  fading-amplitude 
knowledge.  Dashed  curves  denote  estimated  fading  amplitudes. 

fading-block  lengths  simulated  are  N  =  {8,16,32,64,128} 
symbols  per  block. 

The  error  performance  of  the  coded  system  is  shown  in 
Fig.  6,  both  with  perfect  channel  estimates  and  with  estimated 
fading  amplitudes.  A  good  tradeoff  between  diversity  and 
estimation  accuracy  is  achieved  for  block  sizes  N  =  16 
and  N  =  32,  which  exhibit  the  best  performance  of  all 
systems  that  must  estimate  the  fading  amplitudes.  For  N  <  16 
performance  degrades  due  to  the  lack  of  enough  observations 
per  block  for  accurate  channel  estimates,  while  for  N  >  32 
performance  degrades  due  to  the  reduction  in  time  diversity. 

Fig.  7  shows  the  SNR  required  to  reach  an  error  rate  of 
10-4  at  the  relay  as  a  function  of  the  block  length  N.  In  each 
case,  information  is  coded  with  the  same  (2048, 1229)  turbo 
code  used  for  Fig.  6.  Curves  for  three  systems  are  shown:  The 
noncoherent  receiver  with  known  {a,,  0:2},  the  noncoherent 
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Fig.  7.  Signal-to-noise  ratio  required  to  reach  a  bit  error  rate  of  10— 4  at  the 
relay  as  a  function  of  fading-block  length.  The  performance  of  three  systems 
is  shown:  The  noncoherent  receiver  with  known  {ai,o:2}»  the  noncoherent 
receiver  with  estimated  {cci,  0:2},  and  the  noncoherent  receiver  that  does  not 
use  CSI.  All  systems  use  a  Turbo  code  with  rate  1229/2048. 


Fig.  8.  Comparison  of  error-rate  performance  between  the  turbo-coded  DNC 
and  LNC  systems  at  the  relay.  The  solid  lines  denote  DNC,  while  the  dashed 
lines  denote  LNC. 


receiver  with  estimated  {a\,  0:2 },  and  the  noncoherent  receiver 
that  does  not  use  CSI.  When  {13:1,0:2}  are  not  estimated, 
performance  improves  with  decreasing  N  because  of  the 
increased  number  of  blocks  per  codeword,  which  increases 
the  time  diversity.  However,  when  {01,02}  are  estimated, 
the  performance  gets  worse  when  the  block  size  is  smaller 
than  N  =  16.  The  loss  of  time  diversity  as  the  block  size 
increases  is  a  common  problem  for  any  system  operating  over 
a  slow-fading  channel,  and  the  system  proposed  in  this  paper 
is  no  exception.  The  performance  gap  between  the  known- 
CSI  and  no-CSI  receiver  formulations  widens  with  increasing 
block  length. 

An  error-rate  performance  comparison  between  DNC  and 
LNC  is  shown  in  Fig.  8.  Both  systems  use  the  same 
(2048, 1229)  turbo  code.  The  LNC  system  outperforms  the 


Fig.  9.  Comparison  of  the  performance  of  turbo-coded  DNC  and  LNC  at 
the  relay  with  block  size  N  =  32.  For  the  DNC  system,  two  code  rates  are 
shown,  with  the  lower  rate  code  offering  comparable  performance  to  the  LNC 
system. 

DNC  system  by  margins  ranging  between  4  and  6  dB. 

While  the  LNC  system  is  more  energy  efficient  than  the 
DNC  system  when  the  same-rate  turbo  code  is  used,  the 
throughput  of  the  LNC  system  is  worse  than  that  of  the  DNC 
system  because  the  two  terminals  must  transmit  in  orthogonal 
time  slots.  The  loss  in  energy  efficiency  from  using  DNC  ver¬ 
sus  LNC  can  be  recovered  by  having  the  source  terminals  use 
a  lower-rate  turbo  code.  Consider  the  performance  comparison 
shown  in  Fig.  9  for  block  size  N  =  32.  At  E^/Nq  ss  24  dB, 
DNC  using  a  rate  n  =  4500/6400  code  matches  the  error- 
rate  performance  of  LNC  using  a  rate  r\  =  4500/5056  code. 
Because  the  two  terminals  transmit  at  the  same  time,  the  end- 
to-end  throughput  of  DNC  is  higher  than  that  of  LNC,  even 
though  the  DNC  terminals  transmit  to  the  relay  with  a  lower- 
rate  channel  code. 

To  illustrate  the  throughput  improvement  of  DNC  over 
LNC,  consider  the  following  transmission  schedule  for  the 
two  systems.  Assume  the  source  terminals  use  rate  rq  = 
4500/6400  in  DNC,  and  n  =  4500/5056  in  LNC.  Assume 
operation  at  E^/Nq  =  24  dB,  yielding  approximately  equal 
relay  error-rate  performance.  Further,  assume  that  both  sys¬ 
tems  use  code  rate  r-2  =  4500/5056  for  relay  broadcast, 
yielding  approximately  equal  end-to-end  performance.  DNC 
requires  6400  channel  uses  for  transmission  to  the  relay  versus 
2  x  5056  =  10112  for  LNC.  Both  systems  require  5056 
channel  uses  for  relay  broadcast.  The  throughput  for  DNC 
is  thus  T^dnc)  =  9000/(6400+  5056)  =  9000/11,456  bits 
per  channel  use,  and  for  LNC  t(lnc>  =  9000/(3  x  5056)  = 
9000/15, 168  bits  per  channel  use.  The  percentage  throughput 
increase  of  DNC  over  LNC  is  thus  (j'(DNC'> /T(lnc')  —  1)  x 
100  «  32.4%. 

VI.  Conclusion 

A  throughput-improving  technique  for  relaying  in  the  two- 
way  relay  network,  digital  network  coding,  is  refined  for  prac¬ 
tical  operation.  The  system  operates  noncoherently,  providing 
advantages  over  coherent  operation:  there  are  no  requirements 
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for  perfect  power  control,  phase  synchronism,  or  estimates  of 
carrier-phase  offset. 

A  computationally  simple  technique  for  estimating  fading 
amplitudes  at  the  relay  is  implemented.  Error-rate  performance 
in  the  noncoherent  Rayleigh  block-fading  channel  at  several 
block  sizes  is  presented.  The  system  is  simulated  with  and 
without  an  outer  error-correcting  code.  The  coded  error-rate 
performance  of  the  system  using  estimation  differs  from  that 
with  ideal  estimates  by  margins  between  0.7  —  1.5  dB. 

When  the  same-rate  turbo  code  is  used,  digital  network 
coding  has  a  higher  throughput  but  lower  energy-efficiency 
than  link-layer  network  coding  .  The  energy  loss  of  DNC  can 
be  recovered  by  using  a  lower-rate  turbo  code  during  the  MAC 
phase.  Even  when  the  loss  of  spectral  efficiency  due  to  the 
lower-rate  turbo  code  is  taken  into  account,  the  DNC  system 
is  able  to  achieve  a  higher  throughput  than  LNC  at  the  same 
energy-efficiency.  In  the  particular  example  presented  in  this 
paper,  the  DNC  system  is  capable  of  achieving  throughputs 
that  are  32.4%  larger  than  that  of  the  equivalent  LNC  system, 
while  operating  at  the  same  energy  efficiency. 
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